Identity by descent estimation with dense genome-wide genotype data.
نویسندگان
چکیده
We present a novel method, IBDLD, for estimating the probability of identity by descent (IBD) for a pair of related individuals at a locus, given dense genotype data and a pedigree of arbitrary size and complexity. IBDLD overcomes the challenges of exact multipoint estimation of IBD in pedigrees of potentially large size and eliminates the difficulty of accommodating the background linkage disequilibrium (LD) that is present in high-density genotype data. We show that IBDLD is much more accurate at estimating the true IBD sharing than methods that remove LD by pruning SNPs and is highly robust to pedigree errors or other forms of misspecified relationships. The method is fast and can be used to estimate the probability for each possible IBD sharing state at every SNP from a high-density genotyping array for hundreds of thousands of pairs of individuals. We use it to estimate point-wise and genomewide IBD sharing between 185,745 pairs of subjects all of whom are related through a single, large and complex 13-generation pedigree and genotyped with the Affymetrix 500 k chip. We find that we are able to identify the true pedigree relationship for individuals who were misidentified in the collected data and estimate empirical kinship coefficients that can be used in follow-up QTL mapping studies. IBDLD is implemented as an open source software package and is freely available.
منابع مشابه
Inference of Identity-by-Descent in Sib Pairs: Analysis with and without Linkage Disequilibrium
In gene mapping, after an initial genome-wide linkage scan, the next step often involves candidate region studies or fine mapping using dense markers. Dense genotyping, however, introduces linkage disequilibrium (LD). Traditional linkage analysis assuming no LD leads to increased false positive rates. Hence, we develop models to incorporate linkage disequilibrium, focusing on the sib pair desig...
متن کاملEstimation of pairwise identity by descent from dense genetic marker data in a population sample of haplotypes.
I present a new approach for calculating probabilities of identity by descent for pairs of haplotypes. The approach is based on a joint hidden Markov model for haplotype frequencies and identity by descent (IBD). This model allows for linkage disequilibrium, and the method can be applied to very dense marker data. The method has high power for detecting IBD tracts of genetic length of 1 cM, wit...
متن کاملGenome-wide identity-by-descent sharing among CEPH siblings.
The concept of genetic identity-by-descent (IBD) has markedly advanced our understanding of the genetic similarity among relatives and triggered a number of developments in epidemiological genetics. However, no empirical measure of this relatedness throughout the whole human genome has yet been published. Analyzing highly polymorphic genetic variations from the Centre d'études du polymorphisme ...
متن کاملIdentity by descent in the mapping of genetic traits
This report shows how the descent of genome from an ancestor to currently observed descendants results in identity by descent (IBD) in current individuals, and hence similarities in their DNA at genetic marker loci. Conversely, data on the marker genotypes of individuals provides inferences of shared descent of genome in current individuals, not just genome-wide, but in specific genome regions....
متن کاملGenomic regions exhibiting positive selection identified from dense genotype data.
The allele frequency spectrum of polymorphisms in DNA sequences can be used to test for signatures of natural selection that depart from the expected frequency spectrum under the neutral theory. We observed a significant (P = 0.001) correlation between the Tajima's D test statistic in full resequencing data and Tajima's D in a dense, genome-wide data set of genotyped polymorphisms for a set of ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Genetic epidemiology
دوره 35 6 شماره
صفحات -
تاریخ انتشار 2011